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ABSTRACT 

Most of the multivariate statistical techniques rely 
on the assumption of multivariate normality. The effects of 
non-normality on multivariate tests are assumed to be negligible when 
variance-covar iance matrices and sample sizes are equal. Therefore, 
in practice, investigators do not usually attempt to remove 
non-normality. In this simulation study, the effects of non-normality 
on skewed multivariate data in terms of power were examined by 
manipulating the factors such as distribution, sample size, number of 
variables, and variance-covariance matrix. The number of replications 
was set to 500, and sample sizes of 10, 15, and 20 were used, with 2 
sets of variables, and 2 variance-covariance matrices. The 
multivariate Box-Cox transformation was applied to remove 
non-normality. The power of multivariate analysis of variance 
(MANOVA) was then calculated after the transformation. The results 
were compared with the power calculated before the multivariate 
Box-Cox transformation was applied. In conclusion, even when 
variance-covariance matrices and sample sizes were equal, small to 
moderate increases in power were observed. (Author/SLD) 
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Abstract 



Most of the multivariate statistical techniques rely on the assumption of 
multivariate normality. The effects of nonnormality on multivariate tests are 
assumed to be negligible when variance-covariance matrices and sample sizes 
equal. Therefore, in practice, investigators usually do not attempt to remove 
nonnormality. 

In this simulation study , the ef fects of nonnormal ity on skewed 
multivariate data in terms of power were examined by manipulating the factors 
such as distribution, sample size, number of variables, and variance-<x>variance 
matrix. The multivariate Box-Cox transformation was applied to remove 
nonnormality. The power of MANOVA was then calculated after the transformation. 
The results were compared with the power calculated before the multivariate Box- 
Cox transformation applied. In conclusion, even variance-covariance matrices and 
sample sizes were equal, small to moderate increases in power were observed. 
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Introduction 

Most of the univariate and multivariate statistical techniques assume 
additivity, homogeneity of variances, normality, and independence of 
observations. In practice, it is difficult to attain all these assumptions 
simultaneously. Therefore, most investigators face the issue of assumption 
violations in their research. To counter the issue, Tukey (1977) proposed two 
basic solutions to deal with possible violations of the first three assumptions: 
(a) to employ a non-linear or linear transformation to data to meet the 
assumptions, or (b) to develop a new statistical technique that fits data better. 

Ihe first option generally gets higher acceptance in practice because the 
second option involves greater investment of time and effort. Therefore, 
numerous transformation techniques have been extensively studied and reviewed by 
several researchers for univariate cases (eg., Hoyle, 1973). Furthermore, the 
effects of violations on normality, additivity, and homogeneity of variances were 
also investigated (e.g., Box, 1954; Tiku, 1971; Harwell, Rubinstein, Hayes & 
Olds, 1992) . Fortunately, the tests developed for univariate linear models are 
quite rctoust to violations of assumptions in most of the cases (Glass, Peckman 
& Sanders, 1972) c However, the violations of assumptions in multivariate case 
have not been studied as extensively as those in univariate case. Mardia (1971) 
examined the effects of nonnormality on multivariate regression tests and one-way 
MANOVA. He concluded that when variarK^e-covariance matrices and group saitple 
sizes were equal, the effects of nonnormality on multivariate tests were 
considerably negligible. However, Mardia (1971) did not investigate the 
robustness of the multivariate general linear models in depth for different 
distributicsral asscmptions. Therefore, the effects of nonnormality on the 
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multivariate general linear models deserve a further study. 

Ihe purpose of this simulation stuiy was to investigate the effectiveness 
of the multivariate Box-Cox transformation in normalizing the distribution of 
multivariate data and its effect on the power of MANOVA under various sample 
sizes, number of variables, variance-covariance structures, and distributional 
assumptions. 

Theoretical Perspective 

Box and Cox (1964) have suggested a family of transformation to normalize 
observations, to stabilize variance, and linearize the relationship between 
dependent and independent variables in regression. The notable examples of this 
family of transformations are (a) square-root transformation to stabilize 
variance arri to remove non-normality, (b) cube-roct transformation to remove 
nonnormality, and (c) logarithmic transformation to stabilize variance and to 
remove ronnormality. 

Box and Cox considered a family of transformations for x>0 1 

x ; -l 

y(X) = if X*0 

X 

= log e x if X=0 

to simultaneously satisfy all three assunptions. The coefficient X can be 
estimated by using the maximum likelihood method. The maximum likelihood 
estimate of X maximizes the likelihood function L(X) . Furthermore, to test 
whether maximum likelihood estimate X is statistically equal to 1, that denotes 
a normality, the following likelihood ratio test have been proposed: 

1 The Box-Cox power transformation technique can also be used for both positive and negative numbers if 
y<l)=(x-C> is replaced for x . The joint maximum likelihood estimates of ( and X then will be estimated by 
maximizing Likelihood function of <C#D« 
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2{WX)-W1)} * X'i<«> 



where x J ^ (<*) denote the upper 100a% point of % 2 with 1 degree of freedom. To 
avoid the correlation effect between X and 6 where E(y(X))=x8, X is a vector of 
observations of x's and 8 is an unknown parameter, Box and Cox (1982) later 
modified the above transformation and proposed to use the following formula: 

x X -l 

y(X) = if X*0 

= xlog e x if X=0 

where x is a geometric mean of all observations. Box, Hunter and Hunter (1978) 
provided an example how to use the Box-Cox transformation for univariate case 
with a graphical demonstration. Since it requires an extensive calculations, 
Hinkley (1977) and Emerson and Stoto (1982) suggested to estimate X by using 
transformation plot for symmetrizing and straightening the relationship between 
dependent and independent variables. Hines and Hines (1987) also presented a 
chart to calculate X approximately. 

Andrews, Gnanadesikan and Warner (197 3) and Hernandez and Johnson (1980) 
generalized the Box-Cox transformation to multivariate data and its significance 
test for X=l or (X 1# . . ,X p )=(l, . . ,1) where p is a number of variables. For each 
given X, the transformation can be defined as 

= lcg^ij if XpO 

where i=l, . . ,n (n=number of observations) , j=l, . . ,p (p=number of variables) and 
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XjjX). The maximum likelihood estimate of X is calculated by maximizing 

L(l) = (-ry2)log|2|+{S jalrP (; r l)S falrn logy fj } 

where 2 can be replaced by its maximum likelihood estimate. The corresponding 
significance test then is 

2{i tex a)-wi)} * xyoo (i) 

that is the same as univariate case except x 2 distribution has p degrees of 
freedom, where p corresponds to the rammer of variables. Rode and Chinchilli 
(1988) suggested to use the NewtonHRaphson iterative algorithm to obtain the 
maximum likelihood estimate of A. 

Mathod 

Data were generated from two independent populations with equal variance- 
covariance matrices and sample sizes. Hcwever, their corresponding group mean 
vectors were assumed to be different by a .5a (medium effect size) (Cohen, 1988) 
such that /x 1(i) -/x 2(i) =.5cr n where 1 and 2 denoted group membership and i stood for 
a variable i. Type I error rate was set to 0.05 throughout the study. The number 
of replications were limited to 500. 

The following factors were manipulated: 

(a) Three sample sizes were examined, ^=^=10, 15, 20. 

(b) Two sets of variables were enployed, p=2 or p=3. 

(c) Two variance-covariance matrices were specified: 

Case 1 . 2^1^=2 where variables were uncorrelated, PjfO, i*j,for all i and j. 
Case 2 . 2^=22=2 where variables were correlated such that p 12 =.60, p 13 =.55, and 
p 2 3=«35. 

(d) The following combinations of distributions were specified: 

Case 1 . When p=2, variable 1 was highly skewed and moderately leptokurtic 
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(skewness=1.75 and kurtosis=4.00) and variable 2 was moderately skewed and 
sli(#itly leptokurtic (skewness=1.00 and kurtosis=1.00) . 

Case 2 . When p=3, variable 1 and variable 2 were specified as in case 1. Variable 
3 was highly skewed and highly leptokurtic (skewness=2.25 and kurtosis=6.50) . 

Multivariate nonnormal and skewed data were generated by using GENRAW2 
(Joreskog and Sorbcm, 1989) and ITMGNR (Kirisci, 1992) computer programs. The 
maximum likelihood estimates of X were obtained by using a Fortran IV corputer 
program written by the authors. The maximum likelihood estimate of X was 
calculated 500 times for each combination. The average value along with its 
standard deviation score were presented. Furthermore, the significance level of 
the likelihood ratio test statistic for normality (X=l, see equation 1) was 
caiputed and its average significance level was reported. Power and noncentrality 
parameter for MANOVA were ccmputed by using SPSSX and their average scores were 
presented. Power and noncentrality were calculated by utilizing the following 
formulas (Morrison, 1976): 

Power = l-/3(5 2 ) = ^ % >^ a;?t inU^u^) ' 
where F' is a noncentral F distribution and 

Noncentrality parameter = <S 2 = (n,+n 2 ) (MrM 2 ) 'S'Vr/^) • 
where s" 1 is a inverse matrix of variance-covariance matrix 2. 

Finally, chi-square probability plots of squared radii, that was proposed 
by Andrew, Gnanadesikan and Warner (1971, 1973) , were drawn by AXUM (TriMetrix, 
1992) . To draw a chi-square probability plot, Mahalonobis distances (D } ) were 
calculated and ranked frtro the smallest to the largest. Theoretical chi-square 
scores were obtained for each 100(j-.5)/n percentile from GUDF (IMSL, 1989) , 
where n was a saitple size and j varied between 1 and n. As a final step, points 
(Dj, 100 (j-. 5) /n)) were plotted in two-dimensional space. The plot suggests 
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normality, if all points lie closer to a 45-degree straight line. 

Results 

In this simulation study, the effects of nonnormality on skewed 
multivariate data in terms of power were examined. The factors such as 
distribution, sample size, and number of variables were manipulated in order to 
decide whether normality assumption was crucial in MANOVA when the number of 
observations per variable ranged between ar:vr ate values. Since the variance- 
covaricnoe matrices and sairple sizes were assumed equal throughout the study, we 
anticipated that ixtprovert^nts in pcwer after' the transformation should be 
relatively small, if MANOVA is robust to the violation of nonnormality 
assumption. 

According to the results summarized in Table 1, the multivariate Box-Cox 
transformation had a notable effect on the pcwer of the test as well as on the 
noncentrality parameter. The smallest increase in power ( . 09%) was observed when 
2 variables were correlated and the sample size was 15 for each grot?). The 
highest increase (26.1%) was attained when 2 variables were uncorrelated and the 
saitple size was 20 for each group. 



Please insert Table 1 here 



To place a broader perspective the effects of the multivariate Box-Cox 
transformation on the power of MANOVA, it may be useful to examine the maximum 
likelihood estimates of X. Correlated and uncorrelated cases should be examined 
separately, since the maximum likelihood estimates can be affected if variables 
are correlated. When the number of variables was 2 and they were uncorrelated, 
the first variable that was specified as highly skewed and moderately leptokurtic 
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took higher values than those of the second variable that was moderately skewed 
and slightly leptokurtic. The maximum likelihood estimates cxjrresponding to the 
first variable suggested that, on the average, the maximum likelihood estimate 
of X was .16. For the second variable, the values was closer to zero. In 
practice, this may suggest that a logarithmic transformation was appropriate for 
the second variable that was moderately skewed and slightly leptokurtic. The 
results of the likelihood ratio test for normality and their average significance 
level indicated that when sample sizes was 10, the transformation for normality 
was successful (nonsignificant result) at .05 level. When sample size increased 
from 10 to 20, the average significance level dropped to 0.0001. Since x 2 -test 
are very sensitive to number of observations and outliers, this is ususally the 
case in goodness-of-fit tests. When the number of variables were 3 and 
uncorrelated, the maximum likelihood estimates of X were, on the average, 0.10 
for the first variable, 0.20 for the second variable, and 0.56 for the third 
variable that was highly skewed and highly leptokurtic. This suggested that to 
remove nonnormality, X in Box-Cox transformation should be set to 0.10 for the 
first variable, 0.20 for the second variable and 0.55 for the third variable 
(square-root transformation) . The average significance levels followed the same 
pattern as before; it dropped from 0.0034 to 0.0000. 

When the number of variables were 2 and correlated, the maximum likelihood 
estimates of X corresponding to the first variable were closer to 0 that denoted 
a logarithmic transformation . For the second variable, it was closer to 0.30, 
a cube-root transformation was appropriate. The significance levels changed from 
0.0281 to 0.0004. When the number of variables were 3, the maximum likelihood 
estimates of X corresponding to the first variable suggested a square-root 
transformation. The second variable was transformed to remove nonnormal ity 
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by setting X=1.17. The third variable required a logarithmic transformation. 

The effects of Box-Cox tranr formation on multivariate data can be observed 
by examining scire of the selected chi~square probability plots that were 
presented in Figure 1, Figure 2, and Figure 3. As can be noted, if multivariate 
normality holds, all the points to be expected lie on the 4 5 -degree straight 
line. By considering that:, it is possible to visualize the effects of the 
multivariate Box-Cox transformation on normality. 



Please insert Figure 1, Figure 2 and Figure 3 here 



Conclusion 

In educational research and as well as in behavioral sciences , 
investigators heavily rely on multivariate statistical techniques in analyzing 
multivariate data with complex structures. Multivariate normality is one of the 
key assumptions that underlies much of the classical multivariate statistical 
techniques. Therefore, to maet this assumption gains greater importance in data 
analysis. 

In this study, our attention focused on the violation of normality 
assumption for multivariate data and its effect on pcwer. The multivariate Box- 
Cox transformation technique was applied to nonnormal and skewed multivariate 
data. Power and noncentrality parameter were calculated for MSNOVA in order to 
show the effects of the multivariate Box-Cox transformation. lastly, a chi- 
square probability plot of squared radii was employed to transform multivariate 
data into unidimensional space to view how successful the multivariate Box-Cox 
transformation technique in achieving normality. 

Although the multivariate Box-Cox transformation is laborious and time 
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consuming, its drawbacks are only a small price to pay for the benefits that can 
be obtained from the information we gain. A computerized literature survey of 
articles in psychology and behavioral sciences since 1988 using PSYCHINFQ, 
produced by the American Psychological Association, shews that transformation of 
multivariate data for normality has almost never been employed in practice. 
Instead, most researchers assume that under the assumptions of equal variance- 
cavariance matrices and of equal group sarple sizes, the violation of normality 
has a small effect on the power of MANOVA. However, the results of this study 
showed that even if two groups had same variarK^covariance matrices and equal 
sample sizes, small to moderate increases in pewer were observed. Therefore, It 
would be highly advisable to apply the multivariate Box-Cox transformation 
technique to multivariate data to remove non-normality. 
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Figure 1 . 

Chi-Squared Probability Plot of Squared Radii 
Uncorrected Variables. df=3. n«40 
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Figure 2. 

Chi-Squa red Probability Plot of Squared Radii 
Correlated Variables. df=2, n~40 
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Figure 3. 

Chi-Squared Probability Plot of Squared Radii 
Correlated Variables. df=3, n=40 
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